Method and for reconstructing chroma block and video decoding apparatus

ABSTRACT

A method and for reconstructing chroma blocks and a video decoding apparatus are disclosed. In accordance with one aspect of the present disclosure, provided is a method for reconstructing a chroma block of a target block to be reconstructed. The method includes decoding correlation information between first residual samples and second residual samples, the first residual sample, and prediction information of the chroma block from a bitstream, wherein the first residual samples are residual samples of a first chroma component and the second residual samples are residual samples of a second chroma component. The method further includes generating predicted samples of the first chroma component and predicted samples of the second chroma information on the basis of the prediction information, and deriving the second residual samples by applying the correlation information to the first residual samples. The method further includes reconstructing a chroma block of the first chroma component by adding the first residual samples and the predicted samples of the first chroma component and reconstructing a chroma block of the second chroma component by adding the second residual samples and predicted samples of the second chroma component.

CROSS-REFERENCE TO RELATED APPLICATION

This application is a National Phase application filed under 35 USC 371of PCT International Application No. PCT/KR2020/006432 with anInternational Filing Date of May 15, 2020, which claims under 35 U.S.C.§ 119(a) the benefit of Korean Patent Application No. 10-2019-0056974,filed on May 15, 2019, and Korean Patent Application No.10-2020-0058335, filed on May 15, 2020, the entire contents of which areincorporated herein by reference.

TECHNICAL FIELD

The present invention relates to encoding and decoding of a video, moreparticularly, to a method for reconstructing a chroma block and a videodecoding apparatus, which improve encoding and decoding efficiency byefficiently predicting residual samples of a chroma component.

BACKGROUND ART

Since video data has a large data volume compared to audio data or stillimage data, it requires a lot of hardware resources, including memory,to store or transmit the data in its raw form before undergoing acompression process.

Accordingly, storing or transmitting video data typically accompaniescompression thereof by using an encoder before a decoder can receive,decompress, and reproduce the compressed video data. Existing videocompression technologies include H.264/AVC and High Efficiency VideoCoding (HEVC), which improves the encoding efficiency of H.264/AVC byabout 40%.

However, the constant increase of video images in size, resolution, andframe rate and the resultant increase of data amount to be encodedrequire a new and superior compression technique with better encodingefficiency and higher image quality improvement over existingcompression techniques.

DISCLOSURE Technical Problem

In view of this needs, the present invention is directed to providing animproved video encoding and decoding technique. In particular, an aspectof the present invention is related to a technique for improvingencoding and decoding efficiency by deriving the other from one of a Cbchroma component and a Cr chroma component.

Technical Solution

In accordance with one aspect of the present disclosure, provided is amethod for reconstructing a chroma block of a target block to bereconstructed. The method includes decoding correlation informationbetween first residual samples and second residual samples, the firstresidual sample, and prediction information of the chroma block from abitstream, wherein the first residual samples are residual samples of afirst chroma component and the second residual samples are residualsamples of a second chroma component. The method further includesgenerating predicted samples of the first chroma component and predictedsamples of the second chroma information on the basis of the predictioninformation, and deriving the second residual samples by applying thecorrelation information to the first residual samples. The methodfurther includes reconstructing a chroma block of the first chromacomponent by adding the first residual samples and the predicted samplesof the first chroma component and reconstructing a chroma block of thesecond chroma component by adding the second residual samples andpredicted samples of the second chroma component.

In accordance with another aspect of the present disclosure, provided isa video decoding apparatus for reconstructing a chroma block of a targetblock to be reconstructed. The video decoding apparatus comprises adecoding unit configured to decode correlation information between firstresidual samples and second residual samples, the first residualsamples, and prediction information of the chroma block from abitstream, wherein the first residual samples are residual samples of afirst chroma component and the second residual samples are residualsamples of a second chroma component. The video decoding apparatusfurther comprises a prediction unit configured to generate predictedsamples of the first chroma component and predicted samples of thesecond chroma information on the basis of the prediction information, achroma component reconstruction unit configured to derive the secondresidual samples by applying the correlation information to the firstresidual samples. The video decoding apparatus further comprises anadder configured to reconstruct a chroma block of the first chromacomponent by adding the first residual samples and the predicted samplesof the first chroma component and reconstruct a chroma block of thesecond chroma component by adding the second residual samples andpredicted samples of the second chroma component.

Advantageous Effects

As described above, according to some embodiments of the presentinvention, since either one of a Cb chroma component and a Cr chromacomponent is derived without being signaled, the compression performanceof encoding and decoding is improved.

DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram illustrating a video encoding apparatus thatcan implement the techniques of the present disclosure.

FIG. 2 is a diagram for explaining a method of splitting a block byusing a QTBTTT structure.

FIG. 3A is a diagram illustrating a plurality of intra-prediction modes.

FIG. 3B is a diagram illustrating a plurality of intra-prediction modesincluding wide-angle intra-prediction modes.

FIG. 4 is a block diagram illustrating a video decoding apparatus thatcan implement the techniques of the present disclosure.

FIG. 5 is an exemplary block diagram of a video encoding apparatus canimplement an example of a residual block reconstruction method forchroma components.

FIG. 6 is a flowchart illustrating an example of a residual blockreconstruction method for chroma components implemented in the videoencoding apparatus of FIG. 5 .

FIG. 7 is a block diagram illustrating a video decoding apparatus thatcan implement an example of a residual block reconstruction method forchroma components.

FIG. 8 is a flowchart illustrating an example of a residual blockreconstruction method for chroma components implemented in the videodecoding apparatus of FIG. 7 .

FIG. 9 is an exemplary block diagram of a video encoding apparatuscapable of implementing another example of a residual blockreconstruction method for chroma components.

FIG. 10 is a flowchart illustrating an example of a residual blockreconstruction method for chroma components implemented in the videoencoding apparatus of FIG. 9 .

FIG. 11 is a block diagram illustrating a video decoding apparatus thatcan implement another example of a residual block reconstruction methodfor chroma components.

FIG. 12 is a flowchart illustrating an example of a residual blockreconstruction method implemented in the video decoding apparatus ofFIG. 11 .

FIGS. 13 and 14 are flowcharts illustrating other examples of a residualblock reconstruction method for chroma components.

FIG. 15 is a flowchart illustrating an example of performance conditionsof a residual reconstruction method for chroma components.

DETAILED DESCRIPTION

Hereinafter, some embodiments of the present disclosure will bedescribed in detail with reference to the accompanying drawings. In thefollowing description, like reference numerals preferably designate likeelements, although the elements are shown in different drawings.Further, in the following description of some embodiments, a detaileddescription of related known components and functions when considered toobscure the subject of the present disclosure will be omitted for thepurpose of clarity and for brevity.

FIG. 1 is a block diagram illustrating a video encoding apparatus thatcan implement the techniques of the present disclosure. Hereinafter, avideo encoding apparatus and elements of the apparatus will be describedwith reference to FIG. 1 .

The video encoding apparatus includes a picture splitter 110, apredictor 120, a subtractor 130, a transformer 140, a quantizer 145, arearrangement unit 150, an entropy encoder 155, an inverse quantizer160, an inverse transformer 165, an adder 170, a filter unit 180, and amemory 190.

Each element of the video encoding apparatus may be implemented inhardware or software, or a combination of hardware and software. Thefunctions of the respective elements may be implemented as software, anda microprocessor may be implemented to execute the software functionscorresponding to the respective elements.

One video includes a plurality of pictures. Each picture is split into aplurality of regions, and encoding is performed on each region. Forexample, one picture is split into one or more tiles or/and slices.Here, the one or more tiles may be defined as a tile group. Each tile orslice is split into one or more coding tree units (CTUs). Each CTU issplit into one or more coding units (CUs) by a tree structure.Information applied to each CU is encoded as a syntax of the CU, andinformation applied to CUs included in one CTU in common is encoded as asyntax of the CTU. In addition, information applied to all blocks in oneslice in common is encoded as a syntax of a slice header, andinformation applied to all blocks constituting a picture is encoded in apicture parameter set (PPS) or a picture header. Furthermore,information which a plurality of pictures refers to in common is encodedin a sequence parameter set (SPS). In addition, information referred toby one or more SPSs in common is encoded in a video parameter set (VPS).Information applied to one tile or tile group in common may be encodedas a syntax of a tile or tile group header.

The picture splitter 110 determines the size of a coding tree unit(CTU). Information about the size of the CTU (CTU size) is encoded as asyntax of the SPS or PPS and is transmitted to the video decodingapparatus.

The picture splitter 110 splits each picture constituting the video intoa plurality of CTUs having a predetermined size, and then recursivelysplits the CTUs using a tree structure. In the tree structure, a leafnode serves as a coding unit (CU), which is a basic unit of coding.

The tree structure may be a QuadTree (QT), in which a node (or parentnode) is split into four sub-nodes (or child nodes) of the same size, aBinaryTree (BT), in which a node is split into two sub-nodes, aTernaryTree (TT), in which a node is split into three sub-nodes at aratio of 1:2:1, or a structure formed by a combination of two or more ofthe QT structure, the BT structure, and the TT structure. For example, aQuadTree plus BinaryTree (QTBT) structure may be used, or a QuadTreeplus BinaryTree TernaryTree (QTBTTT) structure may be used. Here, BTTTmay be collectively referred to as a multiple-type tree (MTT).

FIG. 2 exemplarily shows a QTBTTT splitting tree structure. As shown inFIG. 2 , a CTU may be initially split in the QT structure. The QTsplitting may be repeated until the size of the splitting block reachesthe minimum block size MinQTSize of a leaf node allowed in the QT. Afirst flag (QT_split_flag) indicating whether each node of the QTstructure is split into four nodes of a lower layer is encoded by theentropy encoder 155 and signaled to the video decoding apparatus. Whenthe leaf node of the QT is not larger than the maximum block size(MaxBTSize) of the root node allowed in the BT, it may be further splitinto one or more of the BT structure or the TT structure. The BTstructure and/or the TT structure may have a plurality of splittingdirections. For example, there may be two directions, namely, adirection in which a block of a node is horizontally split and adirection in which the block is vertically split. As shown in FIG. 2 ,when MTT splitting is started, a second flag (mtt_split_flag) indicatingwhether nodes are split, a flag indicating a splitting direction(vertical or horizontal) in the case of splitting, and/or a flagindicating a splitting type (Binary or Ternary) are encoded by theentropy encoder 155 and signaled to the video decoding apparatus.

Alternatively, prior to encoding the first flag (QT_split_flag)indicating whether each node is split into 4 nodes of a lower layer, aCU splitting flag (split_cu_flag) indicating whether the node is splitmay be encoded. When the value of the CU split flag (split_cu_flag)indicates that splitting is not performed, the block of the node becomesa leaf node in the splitting tree structure and serves a coding unit(CU), which is a basic unit of encoding. When the value of the CU splitflag (split_cu_flag) indicates that splitting is performed, the videoencoding apparatus starts encoding the flags in the manner describedabove, starting with the first flag.

When QTBT is used as another example of a tree structure, there may betwo splitting types, which are a type of horizontally splitting a blockinto two blocks of the same size (i.e., symmetric horizontal splitting)and a type of vertically splitting a block into two blocks of the samesize (i.e., symmetric vertical splitting). A split flag (split_flag)indicating whether each node of the BT structure is split into block ofa lower layer and splitting type information indicating the splittingtype are encoded by the entropy encoder 155 and transmitted to the videodecoding apparatus. There may be an additional type of splitting a blockof a node into two asymmetric blocks. The asymmetric splitting type mayinclude a type of splitting a block into two rectangular blocks at asize ratio of 1:3, or a type of diagonally splitting a block of a node.

CUs may have various sizes according to QTBT or QTBTTT splitting of aCTU. Hereinafter, a block corresponding to a CU (i.e., a leaf node ofQTBTTT) to be encoded or decoded is referred to as a “current block.” AsQTBTTT splitting is employed, the shape of the current block may besquare or rectangular.

The predictor 120 predicts the current block to generate a predictionblock. The predictor 120 includes an intra-predictor 122 and aninter-predictor 124.

In general, each of the current blocks in a picture may be predictivelycoded. In general, prediction of a current block is performed using anintra-prediction technique (using data from a picture containing thecurrent block) or an inter-prediction technique (using data from apicture coded before a picture containing the current block). Theinter-prediction includes both unidirectional prediction andbi-directional prediction.

The intra-prediction unit 122 predicts pixels in the current block usingpixels (reference pixels) positioned around the current block in thecurrent picture including the current block. There is a plurality ofintra-prediction modes according to the prediction directions. Forexample, as shown in FIG. 3A, the plurality of intra-prediction modesmay include two non-directional modes, which include a planar mode and aDC mode, and 65 directional modes. Neighboring pixels and an equation tobe used are defined differently for each prediction mode. The tablebelow lists intra-prediction mode numbers and names thereof.

For efficient directional prediction for a rectangular-shaped currentblock, directional modes (intra-prediction modes 67 to 80 and −1 to −14)indicated by dotted arrows in FIG. 3B may be additionally used. Thesemodes may be referred to as “wide angle intra-prediction modes.” In FIG.3B, arrows indicate corresponding reference samples used for prediction,not indicating prediction directions. The prediction direction isopposite to the direction indicated by an arrow. A wide-angle intraprediction mode is a mode in which prediction is performed in adirection opposite to a specific directional mode without additional bittransmission when the current block has a rectangular shape. In thiscase, among the wide angle intra-prediction modes, some wide angleintra-prediction modes available for the current block may be determinedbased on a ratio of the width and height of the rectangular currentblock. For example, wide angle intra-prediction modes with an angle lessthan 45 degrees (intra prediction modes 67 to 80) may be used when thecurrent block has a rectangular shape with a height less than the widththereof. Wide angle intra-prediction modes with an angle greater than−135 degrees (intra-prediction modes −1 to −14) may be used when thecurrent block has a rectangular shape with width greater than the heightthereof.

The intra-predictor 122 may determine an intra-prediction mode to beused in encoding the current block. In some examples, theintra-predictor 122 may encode the current block using severalintra-prediction modes and select an appropriate intra-prediction modeto use from the tested modes. For example, the intra-predictor 122 maycalculate rate distortion values using rate-distortion analysis ofseveral tested intra-prediction modes, and may select anintra-prediction mode that has the best rate distortion characteristicsamong the tested modes.

The intra-predictor 122 selects one intra-prediction mode from among theplurality of intra-prediction modes, and predicts the current blockusing neighboring pixels (reference pixels) and an equation determinedaccording to the selected intra-prediction mode. Information about theselected intra-prediction mode is encoded by the entropy encoder 155 andtransmitted to the video decoding apparatus.

The inter-predictor 124 generates a prediction block for the currentblock through motion compensation. The inter-predictor 124 searches fora block most similar to the current block in a reference picture whichhas been encoded and decoded earlier than the current picture, andgenerates a prediction block for the current block using the searchedblock. Then, the inter-predictor generates a motion vector correspondingto a displacement between the current block in the current picture andthe prediction block in the reference picture. In general, motionestimation is performed on a luma component, and a motion vectorcalculated based on the luma component is used for both the lumacomponent and the chroma component. The motion information includinginformation about the reference picture and information about the motionvector used to predict the current block is encoded by the entropyencoder 155 and transmitted to the video decoding apparatus.

The subtractor 130 subtracts the prediction block generated by theintra-predictor 122 or the inter-predictor 124 from the current block togenerate a residual block.

The transformer 140 partitions the residual block into one or moretransform blocks, performs a transform on the transform blocks, andtransforms the residual values of the transform blocks from a pixeldomain into a frequency domain. In the frequency domain, the transformblocks are referred to as coefficient blocks containing one or moretransform coefficient values. A two-dimensional (2D) transform kernelmay be used for the transform, and a one-dimensional (1D) transformkernel may be used for each of horizontal transform and verticaltransform. The transform kernels may be based on a discrete cosinetransform (DCT), a discrete sine transform (DST), or the like.

The transformer 140 may transform the residual signals in a residualblock by using the entire size of the residual block as a transformunit. Also, the transformer 140 may partition the residual block intotwo sub-blocks in a horizontal or vertical direction and may perform thetransform on only one of the two sub-blocks. Accordingly, the size ofthe transform block may be different from the size of the residual block(and thus the size of a prediction block). Non-zero residual samplevalues may be absent or very sparse in untransformed sub-block. Residualsamples of the untransformed sub-block may not be signaled and may allbe regarded as “0” by a video decoding apparatus. Several partitiontypes may be present depending on a partitioning direction and apartitioning ratio. The transformer 140 may provide information on acoding mode (or a transform mode) of the residual block (e.g., theinformation on the coding mode includes information indicating whetherthe residual block is transformed or the sub-block of the residual blockis transformed, information indicating a partition type selected topartition the residual block into the sub-blocks, information foridentifying the sub-block to be transformed, etc.) to the entropyencoder 155. The entropy encoder 155 may encode the information on acoding mode (or a transform mode) of a residual block.

The quantizer 145 quantizes transform coefficients output from thetransformer 140 and outputs quantized transform coefficients to theentropy encoder 155. The quantizer 145 may directly quantize a relatedresidual block for a certain block or frame without transform.

The rearrangement unit 150 may perform rearrangement of the coefficientvalues with the quantized transform coefficients. The rearrangement unit150 may use coefficient scanning for changing the two-dimensionalcoefficient array into a one-dimensional coefficient sequence. Forexample, the rearrangement unit 150 may scan coefficients from a DCcoefficient toward coefficients in a high-frequency region through azig-zag scan or a diagonal scan to output a one-dimensional coefficientsequence. Depending on the size of the transform unit and theintra-prediction mode, the zig-zag scan used may be replaced by avertical scan for scanning the two-dimensional coefficient array in acolumn direction and a horizontal scan for scanning the two-dimensionalblock shape coefficients in a row direction. In other words, a scanningmethod to be used may be determined among a zig-zag scan, a diagonalscan, a vertical scan, and a horizontal scan according to the size ofthe transform unit and the intra-prediction mode.

The entropy encoder 155 encodes a sequence of the one-dimensionalquantized transform coefficients outputted from the rearrangement unit150 by using various encoding methods such as Context-based AdaptiveBinary Arithmetic Code (CABAC), Exponential Golomb, and the like,encoding to generate a bitstream.

The entropy encoder 155 encodes information such as a CTU size, a CUsplit flag, a QT split flag, an MTT splitting type, and an MTT splittingdirection, which are associated with block splitting, such that thevideo decoding apparatus may split the block in the same manner as inthe video encoding apparatus. In addition, the entropy encoder 155encodes information about a prediction type indicating whether thecurrent block is encoded by intra-prediction or inter-prediction, andencodes intra-prediction information (i.e., information about anintra-prediction mode) or inter-prediction information (informationabout a reference picture index and a motion vector) according to theprediction type.

The inverse quantizer 160 inversely quantizes the quantized transformcoefficients output from the quantizer 145 to generate transformcoefficients. The inverse transformer 165 transforms the transformcoefficients output from the inverse quantizer 160 from the frequencydomain to the spatial domain and reconstructs the residual block.

The adder 170 adds the reconstructed residual block to the predictionblock generated by the predictor 120 to reconstruct the current block.The pixels in the reconstructed current block are used as referencepixels in performing intra-prediction of a next block.

The filter unit 180 filters the reconstructed pixels to reduce blockingartifacts, ringing artifacts, and blurring artifacts generated due toblock-based prediction and transform/quantization. The filter unit 180may include a deblocking filter 182 and a pixel adaptive offset (SAO)filter 184.

The deblocking filter 180 filters the boundary between the reconstructedblocks to remove blocking artifacts caused by block-by-blockcoding/decoding, and the SAO filter 184 performs additional filtering onthe deblocking-filtered video. The SAO filter 184 is a filter used tocompensate for a difference between a reconstructed pixel and anoriginal pixel caused by lossy coding.

The reconstructed blocks filtered through the deblocking filter 182 andthe SAO filter 184 are stored in the memory 190. Once all blocks in onepicture are reconstructed, the reconstructed picture may be used as areference picture for inter-prediction of blocks in a picture to beencoded next.

FIG. 4 is an exemplary functional block diagram of a video decodingapparatus capable of implementing the techniques of the presentdisclosure. Hereinafter, the video decoding apparatus and its componentswill be described with reference to FIG. 4 .

The video decoding apparatus may include an entropy decoder 410, arearrangement unit 415, an inverse quantizer 420, an inverse transformer430, a predictor 440, an adder 450, a filter unit 460, and a memory 470.

Similar to the video encoding apparatus of FIG. 1 , each element of thevideo decoding apparatus may be implemented in hardware, software, or acombination of hardware and software. Further, the function of eachelement may be implemented in software, and the microprocessor may beimplemented to execute the function of software corresponding to eachelement.

The entropy decoder 410 determines a current block to be decoded bydecoding a bitstream generated by the video encoding apparatus andextracting information related to block splitting, and extractsprediction information and information about a residual signal, and thelike required to reconstruct the current block.

The entropy decoder 410 extracts information about the CTU size from thesequence parameter set (SPS) or the picture parameter set (PPS),determines the size of the CTU, and splits a picture into CTUs of thedetermined size. Then, the decoder determines the CTU as the uppermostlayer, that is, the root node of a tree structure, and extractssplitting information about the CTU to split the CTU using the treestructure.

For example, when the CTU is split using a QTBTTT structure, a firstflag (QT_split_flag) related to splitting of the QT is extracted tosplit each node into four nodes of a sub-layer. For a node correspondingto the leaf node of the QT, the second flag (MTT_split_flag) andinformation about a splitting direction (vertical/horizontal) and/or asplitting type (binary/ternary) related to the splitting of the MTT areextracted to split the corresponding leaf node in the MTT structure.Thereby, each node below the leaf node of QT is recursively split in aBT or TT structure.

As another example, when a CTU is split using the QTBTTT structure, a CUsplit flag (split_cu_flag) indicating whether to split a CU may beextracted. When the corresponding block is split, the first flag(QT_split_flag) may be extracted. In the splitting operation, zero ormore recursive MTT splitting may occur for each node after zero or morerecursive QT splitting. For example, the CTU may directly undergo MTTsplitting without the QT splitting, or undergo only QT splittingmultiple times.

As another example, when the CTU is split using the QTBT structure, thefirst flag (QT_split_flag) related to QT splitting is extracted, andeach node is split into four nodes of a lower layer. Then, a split flag(split_flag) indicating whether a node corresponding to a leaf node ofQT is further split in the BT and the splitting direction informationare extracted.

Once the current block to be decoded is determined through splitting inthe tree structure, the entropy decoder 410 extracts information about aprediction type indicating whether the current block is intra-predictedor inter-predicted. When the prediction type information indicatesintra-prediction, the entropy decoder 410 extracts a syntax element forthe intra-prediction information (intra-prediction mode) for the currentblock. When the prediction type information indicates inter-prediction,the entropy decoder 410 extracts a syntax element for theinter-prediction information, that is, information indicating a motionvector and a reference picture referred to by the motion vector.

Meanwhile, the entropy decoder 410 extracts the information on a codingmode of a residual block (e.g., information on whether a residual blockis encoded or only a sub-block of a residual block is encoded,information indicating a partition type selected to partition a residualbock into sub-blocks, information for identifying encoded residualsub-blocks, quantization parameters, etc.) from a bitstream. Also, theentropy decoder 410 extracts information on quantized transformcoefficients of the current block as information regarding a residualsignal.

The rearrangement unit 415 may change the sequence of the quantized 1Dtransform coefficients entropy-decoded by the entropy decoder 410 backinto a 2D array of coefficients (i.e., a block) in the reverse order ofcoefficient scanning performed by the video encoding apparatus.

The inverse quantizer 420 inversely quantizes the quantized transformcoefficients, and the inverse transformer 430 generates a reconstructedresidual block for the current block by reconstructing residual signalsby inversely transforming the inversely quantized transform coefficientsfrom a frequency domain to a spatial domain on the basis of theinformation on a coding mode of a residual block.

When the information on a coding mode of a residual block indicates thata residual block of the current block is encoded in the video encodingapparatus, the inverse transformer 430 generates a reconstructedresidual block for the current block by performing inverse transform onthe inversely quantized transform coefficients using the size of thecurrent block (and thus the size of a residual block to be restored) asa transform unit.

Also, when the information on a coding mode of a residual blockindicates that only one sub-block of a residual block is encoded in thevideo encoding apparatus, the inverse transformer 430 generates areconstructed residual block for the current block by reconstructingresidual signals for a transformed sub-block through inverse transformon the inversely quantized transform coefficients using the size of thetransformed sub-block as a transform unit and by setting residualsignals for an untransformed sub-block to “0.”

The predictor 440 may include an intra-predictor 442 and aninter-predictor 444. The intra-predictor 442 is activated when theprediction type of the current block is intra-prediction, and theinter-predictor 444 is activated when the prediction type of the currentblock is inter-prediction.

The intra-predictor 442 determines an intra-prediction mode of thecurrent block among a plurality of intra-prediction modes based on thesyntax element for the intra-prediction mode extracted from the entropydecoder 410, and predicts the current block using the reference pixelsaround the current block according to the intra-prediction mode.

The inter-predictor 444 determines a motion vector of the current blockand a reference picture referred to by the motion vector using thesyntax element for the inter-prediction mode extracted from the entropydecoder 410, and predicts the current block based on the motion vectorand the reference picture.

The adder 450 reconstructs the current block by adding the residualblock output from the inverse transformer 430 and the prediction blockoutput from the inter-predictor 444 or the intra-predictor 442. Thepixels in the reconstructed current block are used as reference pixelsin intra-predicting a block to be decoded next.

The filter unit 460 may include a deblocking filter 462 and an SAOfilter 464. The deblocking filter 462 deblocking-filters the boundarybetween the reconstructed blocks to remove blocking artifacts caused byblock-by-block decoding. The SAO filter 464 performs additionalfiltering on the reconstructed block after deblocking filtering tocorresponding offsets so as to compensate for a difference between thereconstructed pixel and the original pixel caused by lossy coding. Thereconstructed block filtered through the deblocking filter 462 and theSAO filter 464 is stored in the memory 470. When all blocks in onepicture are reconstructed, the reconstructed picture is used as areference picture for inter-prediction of blocks in a picture to beencoded next.

In a conventional video encoding/decoding method, in order to reduce thecomplexity of prediction of chroma components, each of the chromacomponents is predicted in the same manner as a prediction process for aluma component, or each of the chroma components is predicted in asimplified way of the prediction process for the luma component.However, such a conventional method has a problem in that colordistortion occurs.

The present disclosure proposes encoding and decoding methods foreffectively predicting a chroma component in a chroma block of a targetblock to be reconstructed (i.e., a current block).

The methods proposed herein are methods in which information on residualsamples (or residual signals) of one of a Cb chroma component and a Crchroma component is coded and signaled, and information on residualsamples of the other one is derived without being coded and signaled.

Herein, residual samples of a chroma component to be derived may bereferred to as “second residual samples of a second chroma component”,and residual samples of a chroma component to be coded and signaled toderive the second residual samples may be referred to as “first residualsamples of a first chroma component”.

The first chroma component may be one of the Cb chroma component and theCr chroma component, and the second chroma component may be the other ofthe Cb chroma component and the Cr chroma component. For example, whenthe residual samples of the Cb chroma component are coded and signaledand the residual samples of the Cr chroma component are derived, theresidual samples of the Cb chroma component may be referred to as firstresidual samples, and the residual samples of the Cr chroma componentmay be referred to as second residual samples. As another example, whenthe residual samples of the Cr chroma component are coded and signaledand the residual samples of the Cb chroma component are derived, theresidual samples of the Cr chroma component may be referred to as firstresidual samples, and the residual samples of the Cb chroma componentmay be referred to as second residual samples.

A method of deriving the second residual samples may be classifiedinto 1) embodiments in which information on a correlation between thefirst residual samples and the second residual samples is used, 2)embodiments in which whether to activate or apply asecond-residual-sample derivation scheme is determined, and the like.Also, the embodiments in which the correlation information is used maybe classified into different embodiments depending on whether aninter-chroma difference value is used. Hereinafter, terms used hereinwill be defined first, and then each embodiment will be described indetail.

Correlation Information

Correlation information refers to information for deriving secondresidual samples from first residual samples and may be adaptivelydetermined according to the range of a luma component value of thecurrent block to be encoded. The correlation information may includemultiplication information or may include multiplication information andoffset information.

The correlation information may be defined at various positions in abitstream and signaled to a video decoding apparatus and may be decodedfrom the positions in the bitstream. For example, the correlationinformation may be defined and signaled at one or more positions amonghigh-level syntaxes (HLS) such as SPS, PPS, and picture level. Asanother example, the correlation information may be signaled at a lowerlevel such as a tile group level, a tile level, a CTU level, a unitblock level (CU, TU, PU), and the like. As another example, a differencevalue (difference correlation information) with correlation informationsignaled through HLS may be signaled at a lower level.

According to an embodiment, the correlation information may not bedirectly signaled, but some information from which the correlationinformation can be derived in the video decoding apparatus may besignaled. For example, table information including fixed values for thecorrelation information may be signaled, and an index value indicatingcorrelation information used to derive a second residual sample amongthe fixed values in the table information may be signaled. As anotherexample, the table information may not be signaled, but may bepredefined between a video encoding apparatus and a video decodingapparatus. The index value may be defined and signaled at one or more ofa tile group level, a tile level, and a unit block level.

The correlation information is information used to derive a secondresidual sample and thus may be signaled when the derivation of a secondresidual sample is applied. Accordingly, the correlation information maybe decoded from the bitstream when a first syntax element, which will bedescribed below, indicates that the derivation of the second residualsample is allowed or may be decoded from the bitstream when a secondsyntax element, which will be described below, indicates that thederivation of the second residual same is applied.

Multiplication Information

Multiplication information refers to information for indicating amultiplication factor between the first residual samples and the secondresidual samples. When the multiplication factor is applied to (a valueof) the first residual sample, a value equal to (a value of) the secondresidual sample or a value within a range corresponding to the secondresidual sample may be derived. The multiplication factor may representa scaling relationship, a weight relationship, a sign relationship, etc,between the first residual samples and the second residual samples.Accordingly, the multiplication factor may be an integer such as −1 or 1or a fraction such as ½ or −½.

When the multiplication information is signaled in the form of a flag of0 or 1 and the multiplication factor represents the sign relationshipbetween the first residual samples and the second residual samples, themultiplication information may represent the multiplication factorthrough the method shown in Equation 1.Multiplication Factor=1−2·(Multiplication Information)  [Equation 1]

Multiplication Information (i.e., a flag) equal to 0 indicates that thefirst residual samples and the second residual samples have the samesign relationship, and a multiplication factor of “1” may be applied tothe first residual samples. Multiplication Information (i.e., a flag)equal to 1 indicates that the first residual samples and the secondresidual samples have different sign relationships, and a multiplicationfactor of “−1” may be applied to the first residual samples.

Offset Information

Offset information refers to information for indicating an offset factorbetween the first residual sample (to which the multiplication factor isapplied) and the second residual sample. When the offset factor isapplied to the first residual sample to which the multiplication factoris applied, a value equal to the second residual sample or a valuewithin a range corresponding to the second residual sample may bederived. The offset factor may be an integer such as −1, 0, or 1 or afraction such as ½ or −½.

In relation to the case of an offset factor equal to 0, the offsetinformation may not be signaled if only multiplication information isincluded in the correlation information, and the offset information mayindicate that the offset factor equals to 0 if the multiplicationinformation and the offset information are included in the correlationinformation.

Inter-Chroma Difference Value

An inter-chroma difference value refers to a difference value betweenthe first residual sample and the second residual sample (i.e., refersto a value obtained by subtraction between the first residual sample andthe second residual sample). More specifically, the inter-chromadifference value corresponds to a value derived by subtraction betweenthe first residual sample to which the correlation information isapplied and the second residual sample. For example, if the correlationinformation includes only the multiplication information, theinter-chroma difference value may be derived by performing subtractionbetween the first residual sample to which the multiplication factor isapplied and the second residual sample. As another example, if thecorrelation information includes the multiplication information and theoffset information, the inter-chroma difference value may be derived byperforming subtraction between the first residual sample to which themultiplication factor and the offset factor are applied and the secondresidual sample.

Embodiment 1

Embodiment 1 is a method of using both correlation information andinter-chroma difference values. Embodiment 1 may be divided into thefollowing sub-embodiments, depending on a step of the encoding steps atwhich a process of deriving inter-chroma difference values andcorrelation information is performed and depending on a step of thedecoding steps at which a process of deriving second residual samples isperformed.

Embodiment 1-1

In Embodiment 1-1, a process of deriving inter-chroma difference valuesand correlation information is performed before a step of transformingresidual samples, and a process of deriving second residual samples isperformed after a step of inversely transforming residual samples.

An exemplary block diagram and flowchart of a video encoding apparatusfor performing Embodiment 1-1 are shown in FIGS. 5 and 6 , respectively,and an exemplary block diagram and flowchart of a video decodingapparatus for performing Embodiment 1-1 are shown in FIGS. 7 and 8 ,respectively.

A subtractor 130 may obtain the first residual samples and the secondresidual samples (S610). Specifically, the first residual samples may beobtained by subtracting between a prediction block (or predictedsamples) of a first chroma component and a chroma block of the firstchroma component, and the second residual samples may be obtained bysubtracting a prediction block of a second chroma component and a chromablock of the second chroma component. The prediction blocks of thechroma components may be derived through prediction of a predictor 120,and information used for the prediction, i.e., prediction informationmay be derived in this process. The process of generating predictedsamples and the process of deriving prediction information may beequally applied to other embodiments of the present specification.

A chroma component predictor 510 may determine whether to derive thesecond residual samples from the first residual samples (S620).

The chroma component predictor 510 may determine, for the chroma blocks,one of a method in which both the first residual samples and the secondresidual samples are coded (i.e., a general method) and a method inwhich the second residual samples is derived (i.e., asecond-residual-sample derivation method). For example, the chromacomponent predictor 510 calculates rate-distortion values throughrate-distortion analysis on the general method and the derivationmethod, and may select or determine one method having the bestrate-distortion characteristics for the chroma blocks. The process ofdetermining whether to derive the second residual samples may be equallyapplied to other embodiments of the present specification.

The chroma component predictor 510 may modify the first residual sampleswhen the second-residual-sample derivation method (i.e., the method inwhich the second residual samples are derived) is selected for thechroma blocks (S630). The modification of the first residual samples maybe achieved by applying the correlation information to the firstresidual samples.

The chroma component predictor 510 may derive inter-chroma differencevalues using the modified first residual samples and the second residualsamples (S640). The inter-chroma difference value may be derived bysubtracting the modified first residual sample and the second residualsample.

Operation S630 and operation S640 may be performed through Equation 2below.Cro_r=Cro_resi2−(W*Cro_resi1+Offset)  [Equation 2]

In Equation 2, Cro_resi1 denotes first residual sample, Cro_resi2denotes a second residual sample, Cro_r denotes an inter-chromadifference value, W denotes a multiplication factor, and Offset denotesan offset factor. Referring back to Equation 2 focusing on the secondresidual sample, Cro_resi2 may be a primary signal of the secondresidual sample (i.e., a primary residual signal of the second chromacomponent), and Cro_r may be a secondary signal of the second residualsample (i.e., a secondary residual signal of the second chromacomponent).

The transformer 140 may transform the inter-chroma difference values andthe first residual samples, and the quantizer 145 may quantize thetransformed inter-chroma difference values and the transformed firstresidual samples (S650). Here, the inter-chroma difference values may bequantized through a “quantization parameter that is changed usingQP_C_offset” from a quantization parameter of the first residual samplesor the luma component. QP_C_offset may be determined by various methods.For example, QP_C_offset may be adaptively determined according to oneor more of the range of a luma component value (the range of abrightness value), the size of a chroma block, and the ranges ofquantization parameters of a luma component. As another example,QP_C_offset may be determined as a value preset in a video encodingapparatus and a video decoding apparatus. As another example, the videoencoding apparatus may determine QP_C_offset as an arbitrary value,perform a quantization process, and signal a value of QP_C_offset usedin the quantization process to the video decoding apparatus. Thequantization method using QP_C_offset may also be applied to otherembodiments of the present specification.

The transformed and quantized inter-chroma difference values, firstresidual samples, correlation information, and prediction informationmay be encoded and signaled to the video decoding apparatus (S660).Here, the second residual samples are not signaled.

The entropy decoder 410 may decode the inter-chroma difference values,the first residual samples, the correlation information, and theprediction information from a bitstream (S810). The inverse quantizer420 inversely quantizes the inter-chroma difference values and the firstresidual samples, and the inverse transformer 430 may inverselytransform the inversely quantized inter-chroma difference values and theinversely quantized first residual samples (S820).

The predictor 440 may generate (or reconstruct) the predicted samples(or predictive block) of the first chroma component and the predictedsamples of the second chroma component on the basis of the predictioninformation (S820).

A chroma component reconstruction unit 710 may determine whether toderive the second residual samples from the first residual samples(whether to activate (allow) and/or apply the second-residual-samplederivation method) (S830). A detailed description of operation S830 willbe described below through a separate embodiment.

The chroma component reconstruction unit 710 may modify the firstresidual samples using the (inversely transformed) correlationinformation when it is determined to derive the second residual samples(S840). Also, the chroma component reconstruction unit 710 may derivethe second residual samples using the modified first residual samplesand the inversely transformed inter-chroma difference values (S850). Thesecond residual samples may be derived by adding the modified firstresidual samples and the inversely transformed inter-chroma differencevalues.

Operation S630 and operation S640 may be performed through Equation 3below.Cro_resi2=(W*Cro_resi1+Offset)+Cro_r  [Equation 3]

The adder 450 may reconstruct the chroma block of the first chromacomponent by adding the first residual samples and the prediction blockof the first chroma component and may reconstruct the chroma block ofthe second chroma component by adding the derived second residualsamples and the prediction block of the second chroma component (S860).

Embodiment 1-2

In Embodiment 1-2, a process of deriving the inter-chroma differencevalues and correlation information is performed after a step ofquantizing residual samples, and a process of deriving the secondresidual samples is performed before a step of inversely quantizingresidual samples.

An exemplary block diagram and flowchart of a video encoding apparatusfor performing Embodiment 1-2 are shown in FIGS. 9 and 10 ,respectively, and an exemplary block diagram and flowchart of a videodecoding apparatus for performing Embodiment 1-2 are shown in FIGS. 11and 12 , respectively.

A subtractor 130 may obtain a first residual samples and a secondresidual samples (S1010). Residual samples of each of chroma componentsmay be acquired by subtracting the prediction block and the chroma blockof each of the chroma components, and the prediction block and theprediction information of each of the chroma components is derivedthrough the prediction process of a predictor 120.

A transformer 140 may transform the first residual samples and thesecond residual samples, and a quantizer 145 may quantize thetransformed first residual samples and the transformed second residualsamples (S1020). Here, the second residual samples may be quantized as avalue obtained by adding a quantization offset for quantization of thesecond residual samples to a quantization parameter of the firstresidual samples. The quantization offset may be determined by variousmethods. For example, the quantization offset may be adaptivelydetermined according to one or more of the range of a luma componentvalue (the range of a brightness value), the size of the first residualsample values, and the bit-depth of the second residual samples. Asanother example, the quantization offset may be determined as a valuepreset in a video encoding apparatus and a video decoding apparatus. Thevideo decoding apparatus may determine a quantization parameter usingdelta-QP signaled from the video encoding apparatus, add thequantization offset to the quantization parameter to derive aquantization parameter of the second residual samples, and theninversely quantize the second residual samples using the derivedquantization parameter. The quantization/inverse quantization methodusing the quantization offset may be applied to other embodiments of thepresent specification.

According to an embodiment, quantization coefficients of “0” may bederived through a quantization process for the second residual samples(i.e., there may be no residual signal in the quantization process). Inthis case, information or a syntax element indicating that quantizationcoefficients of “0” are derived may be signaled from the video encodingapparatus to the video decoding apparatus.

Meanwhile, one or more of a quantization parameter value of the firstresidual sample (to which the quantization offset is not added) (firstvalue), a value obtained by adding the quantization offset to thequantization parameter of the first residual sample (second value), andthe average of the first value and the second value may be used in anin-loop filtering process for the second residual sample. For example,one or more of the first value, the second value, and the average may beused as a parameter for determining the in-loop filtering strength ofthe second residual sample or may be used as a parameter for determiningan index in a table for determining boundary strength. A method in whichone or more of the first value, the second value, and the average valueare used in the in-loop filtering process may be applied to otherembodiments of the present specification.

The chroma component predictor 510 may determine whether to derive thesecond residual samples from the first residual samples (S1030). Thechroma component predictor 510 may modify the quantized first residualsamples when it is determined to derive the second residual samples(S1040). The modification of the first residual samples may be performedby applying the correlation information to the quantized first residualsamples.

The chroma component predictor 510 may derive an inter-chroma differencevalues using the modified first residual samples and the quantizedsecond residual samples (S1050). The inter-chroma difference values maybe derived by performing a subtraction between the modified firstresidual samples and the quantized second residual samples.

Operation S1040 and operation S1050 may be performed through Equation 4below.Q(T(Cro_r))=Q(T(Cro_resi2))−(W*Q(T(Cro_resi1))+Offset)  [Equation 4]

In Equation 4, Q(T(Cro_resi1)) denotes the transformed and quantizedfirst residual samples, Q(T(Cro_resi2)) denotes the transformed andquantized second residual samples, and Q(T(Cro_r)) denotes theinter-chroma difference values derived from the transformed andquantized first residual samples and the transformed and quantizedsecond residual samples.

The inter-chroma difference values, the first residual samples, thecorrelation information, and the prediction information may be encodedand signaled to the video decoding apparatus (S1060). Here, the secondresidual samples are not signaled.

The entropy decoder 410 may decode the inter-chroma difference values,the first residual samples, the correlation information, and theprediction information from a bitstream (S1210). The predictor 440 maygenerate (or reconstruct) predicted samples (predictive block) of thefirst chroma component and predicted samples of the second chromacomponent on the basis of the prediction information (S1220).

The chroma component reconstruction unit 710 may determine whether toderive the second residual samples from the first residual samples(i.e., whether to activate and/or apply the second-residual-samplederivation method) (S1230). A detailed description of operation S1230will be described below through a separate embodiment.

The chroma component reconstruction unit 710 may modify the firstresidual samples using the correlation information when it is determinedto derive the second residual samples (S1240). Also, the chromacomponent reconstruction unit 710 may derive the second residual samplesusing the modified first residual samples and the inter-chromadifference values (S1250). The second residual samples may be derived byadding the modified first residual samples and the inter-chromadifference values.

Operation S1240 and operation S1250 may be performed through Equation 5below.Q(T(Cro_resi2))=(W*Q(T(Cro_res1))+Offset)+Q(T(Cro_r))  [Equation 5]

The inverse quantizer 420 may inversely quantize the first residualsamples and the derived second residual samples and may inverselytransform the inversely quantized first residual samples and theinversely quantized second residual samples (S1260). The adder 450 mayreconstruct the chroma block of the first chroma component by adding theinversely transformed first residual samples and the prediction block ofthe first chroma component and may reconstruct the chroma block of thesecond chroma component by adding the inversely transformed secondresidual samples and the prediction block of the second chroma component(S1270).

Embodiment 2

Embodiment 2 is a method of predicting and deriving second residualsamples using correlation information without using inter-chromadifference values.

Embodiment 2 is different from Embodiment 1 in that inter-chromadifference values are not used and a process of deriving theinter-chroma difference values (S640 or S1050) is not performed.

Except for this distinctions, the remaining processes of Embodiment 1may also be performed in Embodiment 2. Accordingly, as in Embodiment1-1, a process of deriving correlation information in a video encodingapparatus may be performed before a step of transforming residualsamples, and a process of deriving second residual samples in a videodecoding apparatus may be performed after a step of inverselytransforming residual samples. Also, as in Embodiment 1-2, a process ofderiving correlation information in a video encoding apparatus may beperformed after a step of quantizing residual samples, and a process ofderiving second residual samples in a video decoding apparatus may beperformed before a step of inversely quantizing residual samples.However, the remaining steps except for the step oftransforming/quantizing residual samples and the step of inverselyquantizing/inversely transforming residual samples will be describedbelow.

FIGS. 13 and 14 show flowcharts illustrating an example for Embodiment2.

The subtractor 130 may subtract the prediction block of the first chromacomponent and the chroma block of the first chroma component to acquirethe first residual samples and may subtract the prediction block of thesecond chroma component and the chroma block of the second chromacomponent to acquire the second residual samples (S1310).

The chroma component predictor 510 may determine whether to derive thesecond residual samples from the first residual samples (S1320). Thechroma component predictor 510 may derive the correlation informationusing the first residual samples and the second residual samples when itis determined to derive the second residual samples (S1330).

Meanwhile, depending on the embodiment, the second-residual-samplederivation method may include the following three modes when onlymultiplication information is included in the correlation information.

1) Mode 1: The values of the Cb residual samples are signaled, and thevalues of the Cr residual samples are derived by applying amultiplication factor of −½ or +½ to the values of the Cb residualsamples.

2) Mode 2: The values of the Cb residual samples are signaled, and thevalues of the Cr residual samples are derived by applying amultiplication factor of −1 or +1 to the values of the Cb residualsamples.

3) Mode 3: The values of the Cr residual samples are signaled, and thevalues of the Cb residual samples are derived by applying amultiplication factor of −½ or +½ to the values of the Cr residualsamples.

Also, the second-residual-sample derivation method may further includemodes in which an offset factor is applied to each of the first to thirdmodes when the offset information is also included in the correlationinformation.

In this embodiment, the chroma component predictor 510 may determine amode having the best rate distortion characteristic among the abovemodes as a mode for the chroma block. The chroma component predictor 510may integratedly perform a process of determining one of theabove-described general method and the second-residual-sample derivationmethod and a process of determining one of the modes of thesecond-residual-sample derivation method. For example, the chromacomponent predictor 510 may determine a mode or method having the bestrate distortion characteristic, among the general method and the modesin the second-residual-sample derivation method, for the chroma block.

The first residual samples, the correlation information, and theprediction information may be encoded and signaled to the video decodingapparatus (S1340). Here, the second residual samples and theinter-chroma difference values are not signaled.

The entropy decoder 410 may decode the first residual samples, thecorrelation information, and the prediction information from a bitstream(S1410).

The predictor 440 may generate (or reconstruct) the predicted samples(predictive block) of the first chroma component and the predictedsamples of the second chroma component on the basis of the predictioninformation (S1420).

The chroma component reconstruction unit 710 may determine whether toderive the second residual samples from the first residual samples(i.e., whether to activate and/or apply the second-residual-samplederivation method) (S1430). A detailed description of operation S1430will be described below through a separate embodiment.

The chroma component reconstruction unit 710 may derive the secondresidual samples by applying the correlation information to the firstresidual samples when it is determined to derive the second residualsamples (S1440). For example, when the correlation information includesthe multiplication information, the second residual samples may bederived by applying a multiplication factor indicated by themultiplication information to the first residual samples. As anotherexample, when the correlation information includes the multiplicationinformation and the offset information, the second residual samples maybe derived by applying an offset factor indicated by the offsetinformation to the first residual samples to which the multiplicationfactor is applied.

Operation S1440 may be performed through Equation 6 below.Cro_resi2=W*Cro_resi1+Offset  [Equation 6]

Comparing Equation 6 to Equations 2 to 5, it can be seen that, inEmbodiment an inter-chroma difference value is not used (i.e., Cro_r=0).Accordingly, a transform/inverse transform process, aquantization/inverse quantization process, and an encoding/decodingprocess are not performed on the inter-chroma difference values.

The adder 450 may reconstruct the chroma block of the first chromacomponent by adding the first residual samples and the prediction blockof the first chroma component and may reconstruct the chroma block ofthe second chroma component by adding the derived second residualsamples and the prediction block of the second chroma component (S1450).

Embodiment 3

Embodiment 3 is a method of determining whether to derive the secondresidual samples from the first residual samples (i.e., whether to allow(activate) and/or apply the second-residual-sample derivation method).

Whether to perform the second-residual-sample derivation method may bedetermined by various criteria. The various criteria may include 1) avalue of a syntax element (e.g., flag) indicating whether to allowand/or apply the derivation of the second residual samples (i.e.,whether it is on or off), 2) a prediction mode of a target block, 3) therange of a luma component value, etc.

Criterion 1: Syntax Element Indicating On/Off

A first syntax element and/or a second syntax element may be employed inorder to indicate whether to derive the second residual samples.

The first syntax element, which is a syntax element indicating whetherto allow (or activate) the second-residual-sample derivation method(i.e., whether it is on or off), may be defined at various positions ofa bitstream and signaled to the video decoding apparatus. For example,the first syntax element may be defined and signaled at the level of CTUor higher or may be defined and signaled at one or more of unit block(PU, TU, CU) levels, tile level, tile group level, and picture level.

The second syntax element, which is a syntax element indicating whetherto apply the second-residual-sample derivation method to a target block(chroma block) (i.e., whether it is on or off for the target block), maybe defined at various positions of a bitstream and signaled to the videodecoding apparatus. For example, the second syntax element may bedefined and signaled at the level of CTU or higher or may be defined andsignaled at one or more of unit block (PU, TU, CU) levels, tile level,tile group level, and picture level.

According to an embodiment, the first syntax element may be defined andsignaled at a relatively higher level in the bitstream, and the secondsyntax element may be defined and signaled at a relatively lower levelin the bitstream. In this case, the second syntax element may not besignaled at the lower level when the second-residual-sample derivationmethod is switched off at the higher level, and whether to switch on oroff at the lower level may be selectively determined even when thesecond-residual-sample derivation method is switched on at the higherlevel. Therefore, it is possible to improve the bit efficiency for thesecond-residual-sample derivation method.

FIG. 13 shows an example of determining whether to switch on or off thesecond-residual-sample derivation method.

The video encoding apparatus may determine whether thesecond-residual-sample derivation method is allowed and may set a valueof the first syntax element based on a result of the determination andsignal the first syntax element to the video decoding apparatus. Also,the video encoding apparatus may determine whether thesecond-residual-sample derivation method is applied and may set a valueof the second syntax element a result of the determination and signalthe second syntax element to the video decoding apparatus.

The video decoding apparatus may decode the first syntax element from abitstream (S1510) and may determine whether the second-residual-samplederivation method is allowed according to a value of the first syntaxelement (S1520).

When the second syntax element indicates that the second-residual-samplederivation method is allowed (i.e., first syntax element=1; S1520), thevideo decoding apparatus may decode the second syntax element from thebitstream (S1530). Also, the video decoding apparatus may determinewhether the second-residual-sample derivation method is appliedaccording to a value of the second syntax element (S1540).

When the second syntax element indicates that the second-residual-samplederivation method is applied to the target block (i.e., second syntaxelement=0; S1540), the video decoding apparatus may derive the secondresidual sample on the basis of the correlation information and thefirst residual samples (or the correlation information, the firstresidual samples, and the inter-chroma difference values) for the targetblock (S1550).

The derivation of the second residual sample is not performed for thetarget block when the first syntax element indicates that thesecond-residual-sample derivation method is not allowed in operationS1520 (i.e., first syntax element=0) or when the second syntax elementindicates that the second-residual-sample derivation method is notapplied in operation S1540.

Criterion 2: Prediction Mode of Target Block

Whether to switch on or off the second-residual-sample derivation methodmay be adaptively determined in consideration of or according to theprediction mode of the target block (chroma block).

For example, when the chroma block is predicted in one mode among anintra mode, an inter mode, an IBC mode, and a palette mode, thederivation of the second residual samples may be switched on or off. Asanother example, when the chroma block is predicted in two or more modesamong an intra mode, an inter mode, an IBC mode, and a palette mode(when the chroma block is predicted in one of the two or more modes),the derivation of the second residual samples may be switched on or off.

As still another example, when the chroma block is predicted through across-component linear model (CCLM) or a direct mode (DM) among intraprediction modes, the derivation of the second residual samples may beswitched on or off. In this case, information indicating the switchingon or off of the derivation of the second residual samples may besignaled to the video decoding apparatus only when the chroma block ispredicted through CCLM or DM.

As still another example, when the chroma block is predicted through abi-prediction mode or a merge mode among inter prediction modes and whenthe chroma block is predicted with reference to a zeroth referenceimage, the derivation of the second residual sample may be switched onor off. Information indicating the switching on or off of the derivationof the second residual samples may be signaled to the video decodingapparatus only when the chroma block is predicted through abi-prediction mode or a merge mode or only when the chroma block ispredicted with reference to a zeroth reference image.

An example of considering the prediction mode of the chroma block may becombined with the above example of using the first syntax element andthe second syntax element. For example, in operation S1520, when thefirst syntax element equals to 1 and the prediction mode of the chromablock corresponds to a prediction mode in which the derivation of thesecond residual samples is switched on, the second syntax element may bedecoded from a bitstream. (S1530). That is, whether to decode the secondsyntax element may be determined in consideration of the prediction modeof the chroma block.

Criterion 3: Range of Values of Luma Component

The range of values of a luma component (the range of luminance values)may be divided into two or more sections and, depending on which sectionthe values of the luma component of the target block belong to among thedivided sections, whether to apply the second-residual-sample derivationmethod may be determined.

For example, in the case where the range of the values of the lumacomponent is divided into two sections (the first section and the secondsection), the second-residual-sample derivation method may not beapplied when the values of the luma component of the target block belongto the first section, and the second-residual-sample derivation methodmay be applied when values of the luma component of the target blockbelong to the second section, and vice versa.

Among the two or more sections, a section to which thesecond-residual-sample derivation method is not applied may correspondto a “visual perception section” to which a user's vision can reactsensitively, and a section to which the second-residual-samplederivation method is applied may not correspond to the “visualperception section.” Accordingly, instead of being applied to the visualperception section, the second-residual-sample derivation method may beselectively applied only to sections other than the visual perceptionsection, and thus it is possible to prevent deterioration of subjectimage quality.

One or more section value of a section value indicating the range of thefirst section and a section value indicating the range of the secondsection may be signaled from the video encoding apparatus to the videodecoding apparatus. Depending on the embodiment, the section value maybe preset between the video encoding apparatus and the video decodingapparatus without signaling.

Criterion 4: Quantization Result

When quantization coefficients for the second residual samples have verysmall values (i.e., when a small number of quantization coefficientsoccur or exist) due to the accuracy of the prediction of the secondchroma component, the second-residual-sample derivation method may beselectively applied. Also, in this case, the quantization of only apart, not the whole, of the second residual sample may not be omitted(i.e., only some of the second residual samples are signaled).

Other Criteria

When the delta-QP (DQP) of the luma component is greater than or equalto a preset value, when a transform skip mode is not applied to a chromablock, or when a block differential coded modulation (BDPCM) mode is notapplied to a target block, the second-residual-sample derivation methodmay or may not be applied to the target block.

When a picture including the target block is a gradual random access(GRA) picture or an instantaneous decoding recoding (IDR) picture forrandom access, the second-residual-sample derivation method may not beapplied to the target block.

Although exemplary embodiments of the present invention have beendescribed for illustrative purposes, those skilled in the art willappreciate that and various modifications and changes are possible,without departing from the idea and scope of the invention. Exemplaryembodiments have been described for the sake of brevity and clarity.Accordingly, one of ordinary skill would understand that the scope ofthe embodiments is not limited by the embodiments explicitly describedabove but is inclusive of the claims and equivalents thereto.

What is claimed is:
 1. A video decoding apparatus for reconstructing achroma block of a target block to be reconstructed, the video decodingapparatus comprising: a decoder configured to decode correlationinformation between first residual samples and second residual samples,the first residual samples, and prediction information of the chromablock from a bitstream, wherein the first residual samples are residualsamples of a first chroma component and the second residual samples areresidual samples of a second chroma component; a predictor configured togenerate predicted samples of the first chroma component and predictedsamples of the second chroma information on the basis of the predictioninformation; a chroma component reconstruction unit configured to derivethe second residual samples by applying the correlation information tothe first residual samples; and an adder configured to reconstruct achroma block of the first chroma component by adding the first residualsamples and the predicted samples of the first chroma component andconfigured to reconstruct a chroma block of the second chroma componentby adding the second residual samples and predicted samples of thesecond chroma component.
 2. The video decoding apparatus of claim 1,wherein the first chroma component is one of a Cb chroma component and aCr chroma component, and the second chroma component is the other of theCb chroma component and the Cr chroma component.
 3. The video decodingapparatus of claim 1, wherein the decoder is further configured todecode the correlation information from a picture level of thebitstream.
 4. The video decoding apparatus of claim 1, wherein thecorrelation information includes multiplication information forindicating a multiplication factor between the first residual samplesand the second residual samples, and the chroma component reconstructionunit is further configured to derive the second residual samples byapplying a multiplication factor indicated by the multiplicationinformation to the first residual samples.
 5. The video decodingapparatus of claim 4, wherein the correlation information furtherincludes offset information for indicating an offset factor between thefirst residual samples and the second residual samples, and the chromacomponent reconstruction unit is configured to derive the secondresidual samples by applying an offset factor indicated by the offsetinformation to the first residual samples to which the multiplicationfactor is applied.
 6. The video decoding apparatus of claim 1, whereinthe decoder is further configured to: decode a first syntax elementindicating whether derivation of the second residual samples is allowedfrom a sequence parameter set (SPS) level of the bitstream; and decode asecond syntax element indicating whether the derivation of the secondresidual samples is applied to the chroma block from a level lower thanthe SPS level in the bitstream when the first syntax element indicatesthat the derivation of the second residual samples is allowed, and thechroma component reconstruction unit is further configured to derive thesecond residual samples when the second syntax element indicates thatthe derivation of the second residual samples is applied to the chromablock.
 7. The video decoding apparatus of claim 6, wherein the decoderis further configured to decode the second syntax element inconsideration of a predictive mode of the target block.